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This application claims priority on U.S. provisional application Serial No. 60/270,367, filed 
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li fe BACKGROUND OF THE INVENTION 
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g The present invention relates to identifying and classifying microorganisms, such as bacteria and 

¥£ 

2 * fungi, using magnetic resonance spectroscopy, with multivariate analysis. 

f | Throughout this application, various publications are referenced to within parentheses. 

If | J 

|fi| Disclosures of these publications in their entireties are hereby incorporated by reference into this 

;C3 application to more fully describe the state of the art to which this invention pertains. Full 

>f. * **, 
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bibliographic citations for these references may be found at the end of this application, preceding 
25 the claims. 

Microbial taxonomic classification of micro-organisms involves the grouping of those with like 
characteristics, based on detection of multiple metabolities/compounds or analysis of genetic 
material (DNA), from microbial cells. "Gene trees" derived from sequences of the so-called 
30 ancestral ribosomal DNA gene, can distinguish between and within all living organisms down to 
species and, sometimes, individual strain level. Numerical algorithms and "trees" can also be 
constructed from profiles of microbial metabolities, provided the conditions of culture have been 
carefully standardised. In the medical setting, identification of microbial pathogens allows the 
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clinician to predict and initiate appropriate therapy and to provide prognostic information to 
patients. 

Sites of infected tissue are composites of microbial cells, host immune cells and usually, cells of 
the organ or tissue where the infection is localized. Pathological diagnosis traditionally is time- 
consuming and labour-intensive, being reliant on histopathological examination and microbial 
identification by morphology and culture, or, sometimes, other methods. 

In both clinical and industrial laboratories methods for identification of microorganisms have 
historically been based on multiple phenotypic characters, including morphological features and 
a range of biochemical reactions. These tests are often time consuming and/or relatively 
expensive in their application and some are imprecise. Recently, alternative methods have been 
investigated in an attempt to develop a single, rapid method for characterization and 
identification of micro-organisms. These have included Fourier transform infrared spectroscopy 
(FTIRS) (11) (14), pyrolysis mass spectrometry (PyMS) (12), electrospray ionization mass 
spectrometry (EIMS)(7), UV resonance Raman spectroscopy (UVRRS) (15), and protein 
electrophoresis (16). While reports of these techniques suggest the possibility of rapid and 
reliable identification of some groups of microorganisms, most have been tested with small data 
sets. With the exception of FTIRS they are destructive techniques which analyze cellular 
decomposition products. All have the limitation that they do not directly yield information about 
the biochemistry of the intact viable organism. 
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In contrast, magnetic resonance spectroscopy (MRS) of viable cells can provide information on a 
large range of metabolites. Biological applications of MRS most commonly exploit the non- 
invasive nature of the technique to study aspects of cellular biochemistry in living systems (6). 
However, not all applications of MRS require or include identification of the metabolites 
contributing to the MR spectrum. Pattern recognition techniques, which detect gross spectral 
characteristics associated with a-priori defined classes (such as pathological conditions), have 
been successfully applied to MRS of both tissues and body fluids. Accurate and reliable 
classifiers based on multivariate analyses of *H MR spectroscopic data, have been developed and 
validated for objective diagnosis of thyroid (21), ovarian (23), prostate (9), breast (13), and brain 
tumours (20). In some pathologies MRS is able to detect malignancy before morphological 
manifestations are visible by light microscopy (17). 

Cryptococcosis, caused by C. neoformans, is a potentially life-threatening mycosis of 
immunocompromised and healthy hosts. C. neoformans is the commonest cause of fungal 
meningitis (1) and circumscribed lesions (cryptococcomas) can occur in both lung and brain (2, 
3). Cerebral cryptococcomas have been reported in up to fourteen percent of Australian patients 
presenting with cryptococcosis, depending on host immune status at diagnosis (4). Brian lesions 
are usually diagnosed after C neoformans has been identified in tissue or fluids obtained from 
other body sites, or in cerebrospinal fluid (CSF). 

Brain biopsy is required for diagnosis when lesions are confined to the brain (3) or in the absence 
of other diagnostic material, as the pathology of infective lesions cannot be reliably distinguished 
by modalities such as computed tomography (5) or magnetic resonance imaging (MRI) (6). 
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5 Proton magnetic resonance spectroscopy ( ! H MRS) has been applied to tumours, stroke and 
bacterial infections (7-14). In vivo MRS of the brain was developed and comprehensively tested 
for the diagnosis of human tumours (7, 9, 13) based on initial ex vivo and in vivo studies in 
animal models (15). MRS has identified tumour pathology in human biopsies with a very high 
sensitivity and specificity (16-20). 

10 

hh MR- visible compounds from micro-organisms and/or cells recruited during the host immune 
% S response may give rise to diagnostic and prognostic markers. Extracellular carbohydrates and 

{J. ; 

other products of C. neofbrmans have been identified in CSF from patients with cryptococcal 
fX meningitis (21). Cells of C. neofbrmans are distinguished from those of other invasive fungal 
i?;|5 pathogens by an external polysaccharide capsule, which comprises a high percentage of the 
f \ J biomass in cryptococcomas. The purified capsular material has been studied by H and C MRS 
CI (22). More recent studies have identified extracellular products of C. neofbrmans cultured in 
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vitro using MRS (23). 



20 Pathogenic bacteria and fungi are normally identified and classified on the basis of their cellular 
morphology and biochemistry. Traditional methods are usually time-consuming, as several 
physiological tests are required for unequivocal identification. Where only minor pheno- and 
chemotypic differences exist, as for some species of the yeast genus Candida(64), such tests may 
be difficult or even fail to be definitive. Genotypic methods are more accurate but labor intensive 

25 and expensive. 
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SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a statistical classifier for enabling the 
identificaitn, preferably down to the species group or species level, of various microorganisms. 
As used herein, the term "microorganism" means any microscopic organism (i.e., any unicellular 
or multicellular living entity) including bacteria, fungi, parasites, viruses, protozoa and algae. 

According to the present invention a one-dimensional *H MR spectrum of a microorganism such 
as a bacterial cell suspension provides an overview of hydrogen-containing compounds. 
Consequently, the l H MR spectrum is more representative of the physiology of the cell 
(metabolite pools) than its structure (comprising immobile components such as the cell wall). 
While many different bacterial groups may express and utilize essentially identical metabolic 
pathways, differing levels of enzyme expression and activity in different groups could give rise 
to distinctly different levels of particular metabolities when dissimilar groups are grown in 
similar environments. It was, therefore, proposed that significantly different metabolite pool 
sizes could be detected as differences between the l H MR spectra of the different bacterial 
groups. This was suggested in a previous study comparing selected bacterial ! H MR spectra (5), 
however the small number of isolates examined and the qualitative identification methods 
described in that study did not permit automation or quantitative comparison of the species 
groups. 
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5 According to the present invention, the use of linear discriminant analysis (LDA) on cultures of 
different microorganism isolates, such as different species of bacteria, enabled reliable 
automated identifications of the bacteria species to be made on the basis of their l H MR spectra. 

According to the present invention, a new fingerprinting technique is provided for identification 
10 of microorganisms, bacteria, by combining proton magnetic resonance spectroscopy ( l H MRS) 
!p with multivariate statistical analysis. This has resulted in an objective identification strategy for 
|S common clinical isolates belonging to the bacterial species Staphylococcus aureus, 

Il-.i- 

jp Staphylococcus epidermidis, Enterococcus faecalis, Streptococcus pneumoniae, Streptococcus 

|| i % 

fS pyogenes, Streptococcus agalactiae, and the Streptococcus milleri group. A total of 312 cultures 
If 5 5 of 104 different isolates were examined using ! H MRS. An optimized classifier was developed 

m n 
li Mi 

W using a bootstrapping process and LDA to provide objective classification of the spectra. 
|f Identification of isolates was based on classification of spectra from duplicate cultures and 

./ \ I?* - 

achieved 94% agreement with conventional methods of identification. Less than 1% of isolates 
were identified incorrectly. Identification of the remaining 5% of isolates was defined as 
20 indeterminate. A small number of isolates of Enterococcus casseliflavus and E. gallinarum were 
examined and could be distinguished from E. faecalis, with 96% agreement with conventional 
identification methods. 

According to the present invention, MRS is able to identify metabolites that identify and 
25 distinguish between micro-organisms. When combined with LDA, MRS can identify species of 
pathogenic bacteria that belong to different genera(65). According to the present invention, MRS 
with LDA enabled identification of pathogenic micro-organisms that are taxonomically closer 
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related then shown by Bourne et al(63). For example, five pathogenic Candida species as well as 
the two varieties the pathogenic yeast species Cryptococcus neoformans (var. gattii and var. 
neoformans) can be identified according to the invention. 

According to the present invention, a method for obtaining a statistical classifier for classifying 
microorganisms of unknown species into known species is provided, comprising (a) obtaining a 
plurality of magnetic resonance spectra of each of a plurality of different species of 
microorganisms whose species is known, (b) locating a plurality of maximally discriminatory 
subregions in the magnetic resonance spectra obtained, and (c) cross- validating the spectra by 
selecting a first portion of the spectra from each species, developing linear discriminant analysis 
classifiers from the first portion of the spectra from each species, and validating the remainder of 
the spectra from each species using the classifiers from the first portion of the spectra from each 
species to obtain optimized linear discriminant analysis coefficients and classifier spectra for 
each of the known species of microorganisms, which coefficients and classifier spectra can be 
used to determine the species of microorganisms whose species are unknown. 

According to another aspect of the invention, a method for determining the species of a 
microorganism of unknown species is provided, comprising obtaining magnetic resonance 
spectra of the microorganism of unknown species, and comparing the spectra obtained with a 
species classifier, said classifier having been obtained by (a) obtaining a plurality of magnetic 
resonance spectra of each of a plurality of different species of microorganisms whose species is 
known; (b) locating a plurality of maximally discriminatory subregions in the magnetic 
resonance spectra obtained, and (c) cross-validating the spectra by selecting a first portion of the 
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spectra from each species, developing linear discriminant analysis classifiers from the first 
portion of the spectra from each species, and validating the remainder of the spectra from each 
species using the classifiers from the first portion of the spectra from each species to obtain 
optimized linear discriminant analysis coefficients and classifier spectra for each of the known 
species of microorganisms, and selecting, as the species of the unknown species of 
microorganism, the microorganism whose spectra has the closest match to the spectra of the 
unknown microorganism species. 



According to the present invention, a method for obtaining a statistical classifier for classifying 
microorganisms of unknown species into known species is provided, comprising (a) obtaining a 
plurality of magnetic resonance spectra of each of a plurality of different species of 
microorganisms whose species is known, (b) locating a plurality of maximally discriminatory 
subregions in the magnetic resonance spectra obtained, and (c) cross-validating the spectra by 
selecting a first portion of the spectra from each species, developing linear discriminant analysis 
classifiers from the first portion of the spectra from each species, and validating the remainder of 
the spectra from each species using the classifiers from the first portion of the spectra from each 
species to obtain optimized linear discriminant analysis coefficients and classifier spectra for 
each of the known species of microorganisms, which coefficients and classifier spectra can be 
used to determine the species of microorganisms whose species are unknown. 
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DESCRIPTION OF THE DRAWINGS 

Figure 1 A shows representative l H MR spectra: of E. faecalis, S. milleri, S. pneumoniae and S. 
pyogenes isolates. Refer to Table 3 for the identity of the major metabolites contributing to the 
spectra in each integration region. 

Figure IB shows representative l H MR spectra of: S. epidermidis, S. aureus and S. agalactiae 
isolates. The intense betaine peaks in the spectra of S. aureus and S. epidermidis and the GPC 
peak of S. agalactiae have been truncated to show details of the less intense peaks. The relative 
intensities of the betaine and GPC peaks can be seen in Fig. 2. Refer to Table 3 for the identity 
of the major metabolites contributing to the spectra in each integration region. 

Figures 2A and 2B show Range of measured integral intensities for each species group, 
the bars showing mean ± SD. 

Figure 3 shows: ID *H MR spectra from in vitro cell cultures: A) Crytococcus neoformans, B) 
Candida albicans, C) Aspergillus fumigatus, D) Saccharomyces cerevisiae and E) C6 cell line. 
Identification of the resonances: AA, amino acids, ac, acetate; CH, nonspecific carbohydrate 
resonances; lip, lipids; NCH, contributions from creatine, GABA, lys residues; N(CH3>3, 
contributions from choline containing compounds (chol, PC, GPC), betaine and tau; tre, a,a- 
trehalose. Note the prominent trehalose resonances in the spectrum from C. neoformans, which 
are not distinguishable in the other spectra; 



62620/PJP 



5 Figure 4 shows ID and 2D COSY MR spectra from cerebral rat tissue samples: (a) control brain 
tissue, (b) tissue infected with C. neoformans, and (c) tissue with glioma. Identification of the 
resonances: A-G, triglyceride resonances (32), AA, amino acid residues; ac, acetate; ala, alanine; 
chol, choline; EA, ethanolamine; eth, ethanol; GABA, y-amino butanoic acid; glu/ gin, 
glutamate/ glutamine; GPC, glycerol-phosphocholine; h-tau, hypo-taurine; ile, isoleucine; lac, 
10 lactate; leu, leucine; lip, lipid; lys, lysine; ml, myo-inositol; NAA, N-acetyl aspartate; NCH^, 
contributions from creatine, phospho-creatine, GABA, lysine; N(CH 3 ) 3> contributions from 

-is i 

|| choline containing compounds (choline, PC, gPC), betaine and taurine PC, phosphocholine; PE, 
jf{ phosphoethanolamine; tau, taurine; thr, threonine; tre, -trehalose; val, valine. The listed amino 

la 

ii acids refer to amino acid residues and not necessarily to the respective free amino acids; and 




f i J Figure 5 is a block diagram of a system according to the invention. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 

According to the present invention, a method for obtaining a statistical classifier for classifying 
20 microorganisms of unknown species into known species is provided, comprising (a) obtaining a 
plurality of magnetic resonance spectra of each of a plurality of different species of 
microorganisms whose species is known, (b) locating a plurality of maximally discriminatory 
subregions in the magnetic resonance spectra obtained, and (c) cross-validating the spectra by 
selecting a first portion of the spectra from each species, developing linear discriminant analysis 
25 classifiers from the first portion of the spectra from each species, and validating the remainder of 
the spectra from each species using the classifiers from the first portion of the spectra from each 
species to obtain optimized linear discriminant analysis coefficients and classifier spectra for 
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5 each of the known species of microorganisms, which coefficients and classifier spectra can be 
used to determine the species of microorganisms whose species are unknown. 



The method preferably further comprises the step of repeating step (c) a plurality of times, each 
time selecting as the first portion of the spectra a different portion of the spectra from the species, 
10 to obtain a different set of optimized linear discriminant analysis coefficients for the species, and 
obtaining a weighted average of the linear discriminant analysis coefficients to obtain final 
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classifier spectra. The step of cross-validating the spectra preferably comprises cross validating 
the spectra by randomly selecting about half of the spectra. The step of repeating step (c) a 
plurality of times preferably comprises repeating step (c) about 1000 times. 



The method preferably includes the steps of obtaining a plurality of classifier spectra 
independently, and aggregating the results of the independent classifiers to obtain a consensus 
diagnosis. 



20 The microorganisms may include bacteria, including the species of Staphylococcus aureus, 
Staphylococcus epidermidis, Enterococcus faecalis, Enterococcus casseliflavus, Enterococcus 
gallinarum, Streptococcus pneumoniae, Streptococcus pyogenes and Streptococcus agalachae. 
The microorganisms may include fungi, including pathogenic yeasts such as Candida albicans, 
Candida parapsilosis, Candida tropicalis, Candida krusei, and Candida glabrata, and 

25 Cryptococcus varieties of neoformans and gattli. The microorganisms may include cultured 
bacterial infections, and/or specimens from a mammal containing bacterial infections. 
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The plurality of magnetic resonance spectra of each different species is preferably at least 10 ; 
and may be at least 30. 



resonance 



According to another aspect of the invention, a method for determining the species of a 
microorganism of unknown species is provided, comprising obtaining magnetic resonance 
spectra of the microorganism of unknown species, and comparing the spectra obtained with a 
species classifier, said classifier having been obtained by (a) obtaining a plurality of magnetic 
resonance spectra of each of a plurality of different species of microorganisms whose species is 
known; (b) locating a plurality of maximally discriminatory subregions in the magnetic 
spectra obtained, and (c) cross-validating the spectra by selecting a first portion of the 
spectra from each species, developing linear discriminant analysis classifiers from the first 
portion of the spectra from each species, and validating the remainder of the spectra from each 
species using the classifiers from the first portion of the spectra from each species to obtain 
optimized linear discriminant analysis coefficients and classifier spectra for each of the known 
species of microorganisms, and selecting, as the species of the unknown species of 
microorganism, the microorganism whose spectra has the closest match to the spectra of the 
unknown microorganism species. 



The method preferably further comprises the step of repeating step (c) a plurality of times, each 
time selecting as the first portion of the spectra a different portion of the spectra from the species, 
to obtain a different set of optimized linear discriminant analysis coefficients for the species, and 
obtaining a weighted average of the linear discriminant analysis coefficients to obtain final 
classifier spectra. The step of cross-validating the spectra preferably comprises cross validating 
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5 the spectra by randomly selecting about half of the spectra. The step of repeating step (c) a 
plurality of times comprises repeating step (c) about 1000 times. The method preferably 
includes the steps of obtaining a plurality of classifier spectra independently, and aggregating the 
results of the independent classifiers to obtain a consensus diagnosis. 

10 The microorganisms may include bacteria, including the species of Staphylococcus aureus, 
Staphylococcus epidermidis, Enterococcus faecalis, Enterococcus casseliflavus, Enterococcus 
I? gallinarum, Streptococcus pneumoniae, Streptococcus pyogenes and Streptococcus agalachae. 

R The microorganisms may include fungi, including pathogenic yeasts such as Candida albicans, 

! F ^ *5i 

,1.15 Candida parapsilosis, Candida tropicalis, Candida kruseU Candida glabrata, and Cryptococcus 
f II varieties of neoformans and gatfft. The microorganisms may include cultured bacterial 

jj ij 

11* infections, anchor specimens from a mammal containing bacterial infections. 

The plurality of magnetic resonance spectra of each different species is preferably at least 10, 
20 and may be at least 30. 

According to another aspect of the invention, a statistical classifier for classifying 
microorganisms of unknown species into known species is provided, comprising (a) a 
spectrometer for obtaining a plurality of magnetic resonance spectra of each of a plurality of 
25 different species of microorganisms whose species is known, (b) a locator for locating a plurality 
of maximally discriminatory subregions in the magnetic resonance spectra obtained, and (c) a 
cross-validator for cross-validating the spectra by selecting a first portion of the spectra from 

13 



62620/PJP 

each species, developing linear discriminant analysis classifiers from the first portion of the 
spectra from each species, and validating the remainder of the spectra from each species using 
the classifiers from the first portion of the spectra from each species to obtain optimized linear 
discriminant analysis coefficients and classifier spectra for each of the known species of 
microorganisms, which coefficients and classifier spectra can be used to determine the species of 
microorganisms whose species are unknown. 

The cross-validator preferably repeats step (c) a plurality of times, each time selecting as the first 
portion of the spectra a different portion of the spectra from the species, to obtain a different set 
of optimized linear discriminant analysis coefficients for the species, and obtaining a weighted 
average of the linear discriminant analysis coefficients to obtain final classifier spectra. The 
cross-validator preferably cross validates the spectra by randomly selecting about half of the 
spectra. The classifier preferably repeats step (c) about 1000 times. The classifier preferably 
obtains a plurality of classifier spectra independently, and aggregates the results of the 
independent classifiers to obtain a consensus diagnosis. 

The microorganisms may include bacteria, such as Staphylococcus aureus, Staphylococcus 
epidermidis, Enterococcus faecalis, Enterococcus casseliflavus, Enterococcus gallinarum, 
Streptococcus pneumoniae, Streptococcus pyogeries, and Streptococcus agalachae. 

The microorganisms may include fungi, including pathogenic yeasts such as Candida albicans, 
Candida parapsilosis, Candida tropicalis, Candida bruseU Candida glabrata, and Cryptococcus 



14 



62620/PJP 

varieties of neoformans and gattli. The microorganisms may include cultured bacterial 
infections and/or specimens from a mammal containing bacterial infections. 

The plurality of magnetic resonance spectra of each different species is preferably at least 10 and 
may be at least 30. 

A number of examples will be described for identifying different types of microorganisms, in 
particular different bacteria and fungi. However, the present invention is not limited to the 
species of bacteria and fungi in the examples, and can be used for any microorganism capable of 
being identified through the method disclosed herein. 

Example 1 : Bacteria Detection 

1. Storage and culture of bacteria 

Isolates were obtained from the collection of the Centre for Infectious Diseases and 
Microbiology (CIDM), Institute of Clinical Pathology and Medical Research, Sydney and the 
American Type Culture Collection, or were recent clinical isolates from the clinical 
identification laboratory of the CIDM Laboratory services. Stored isolates were suspended in 
10% glycerol in nutrient broth at -70°C. Horse blood agar (HBA) was prepared by addition of 
sterile horse blood to autoclaved blood agar base (Oxoid (UK) or Amyl Media (Australia)). 
Isolates retrieved from storage were subcultured on to 5% horse blood agar and incubated in 5% 
C0 2 for 18-24 hours at 37°C. New isolates and isolates subcultured on HBA after storage were 
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5 streaked onto duplicate HBA plates and incubated at 37°C for 18-24 hours and then stored at 
ambient temperature (20-30°C) for 3-9 hrs before spectroscopy. 

To test for short-term method variability, examined duplicate cultures of all isolates were 
examined. To test for long-term culture and method variability a number of isolates we 
10 recultured up to seven times over an eight month period. Included in the analysis were spectra of 
m 3 isolates of Enterococcus gallinarum and 3 isolates of E. casseliflavus which are closely related 
if! to E. casseliflavus which are closely related to E. faecalis (10) (Table 1). The number of distinct 

:i : A 

JM* isolates examined from each species group, and the number of times the isolate was recultured 

It til 

I P and reexamined can be determined from Table 1 . 

m 

Mil 

• "". 

X'- 1 % 

2:1' 2. Conventional identification of bacteria 

w Staphylococcus aureus was identified on the basis of positive coagulase (using rabbit or human 
plasma) and DNase tests. Staphylococcus epidermidis was identified using the API ID32 staph 
test (BioMerieux, France). Streptococcus and Enterococcus species were identified by 

20 conventional methods - optochin sensitivity (Streptococcus pneumoniae)^ salt tolerance and bile- 
esculin positivity {Enterococcus spp.\ latex agglutination (Streptococcus agalactiae\ and by 
API ID32 strep test (BioMerieux). All tests were carried out according to the manufacturers' 
instructions. In general, isolates were identified only once upon receipt in the microbiology 
laboratory and prior to storage. Some isolates retrieved from storage were reidentified by 

25 conventional tests. 

3. *H MR Spectroscopy 

16 
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5 Bacterial colonies (2-200 mg wet weight) were gently removed from the HBA plate with a 
plastic inoculating loop and suspended by vortex in 0.3 mL phosphate buffered saline (pH 7.2, 
room temperature) made up in D 2 0 (PSB/D 2 0). For most cultures > 80% of cells were scraped 
off the plate. In cases of heavy growth < 10% of cells were harvested, usually from the first 
quadrant. The suspension was immediately transferred to a 5mm susceptibility-matched MR 
10 sample tube (Shigemi, USA). *H MRS measurements were performed at 37°C on a Bruker 
Avance 360 MHz MR spectrometer using ! H/ 13 C 5 mm probe head. ID spectra were acquired 
with acquisition parameters as follows: frequency 360.13 MHz, pulse angle 90° (6-7 ms), 
repetition time Is, 8k data points, 256 or 512 transients, spectral width 3600 Hz, total acquisition 
time 10 or 20 min. The field was locked to D 2 0. Water suppression was effected by a selective 



15 excitation field gradient method (DPFGSE, (3)). Spectra of cells suspended in PBS/D 2 0 were 

hi • 

stable for at least two hours at 37°C. 
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4. Signal assignment 

Two dimensional (2D) homo- and heteronuclear correlation spectra were acquired for at least 
two isolates per species to assign ID MR resonances to specific compounds. {*H, ! H} gradient 

20 COSY experiments were performed in magnitude-mode. Acquisition parameters were: sweep 
width in t 2 3600 Hz, t 2 time domain 2K, 256 increments of 32 or 48 acquisition each, repetition 
time Is. Sine-bell window functions were applied in the ti dimension, and Gaussian-Lorentzian 
window functions were applied in the t 2 dimension. Zero filling was used to expand the data 
matrix to 1 K in the ti dimension. TOCSY spectra with mixing times of 40 ms and 150ms were 

25 acquired with 256 increments of 2K data points and 32 acquisitions (1). { H, C} one-bond 
shift correlation spectra were obtained in the ! H detection mode using a gradient HSQC pulse 
sequence (24). The *H MR spectral width was 3600 Hz and the 13 C MR spectral width was 
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5 15000 Hz. 13 C MR decoupling during acquisition was achieved by GARP-1 (18). The evolution 
time (t t ) was incremented to obtain 400 FEDs, each of 40-64 acquisitions and consisting of 2K 
data points. The repetition time was Is. A sine-bell function was applied in the t 2 dimension and 
a Gaussian-Lorentzian function was applied in the ti dimension. Zero filling to 1 K was used in 
the t, dimension prior to Fourier transformation. {*H, 13 C} gradient Heteronuclear Multiple 

10 Bond Correlation spectra (HMBC) were acquired without proton decoupling using the same 
parameters as for the HSQC experiments except for a 13 C MR spectral width of 20 kHz (24). 

ifc.i,; 

f! One-bond and long-range correlation experiments were usually optimized for Jc,h of 140 Hz and 

5 J 4 

|P n J C)H of 7Hz respectively. ID l H MR spectra were acquired before and after the 2D experiments 
V% to verify absence of metabolic changes. 

m 

115 5. Data processing 

a ■•-■[ 
> ti.il 

i?i % 

|j Spectra were processed using Bruker XWlOTsIMR spectrometer software. Zero filling was 
I; J performed to extend the free induction decay data set to 16K. An exponential window function 
was applied before Fourier transformation yielding a line broadening of 1 Hz. Chemical shift 
calibration was performed by setting the center of the spectrum to 4.64ppm (nominal position of 
20 the water resonance with respect to tetramethylsilane in PBS/D 2 0 at 37°C). Spectra were 
manually phase corrected to achieve a linear and flat baseline. 



Sixteen contiguous fixed integration regions were subjectively chosen on the basis of major 
peaks present in the representative spectra (Fig. 1). The individual integrals were normalized to 
25 the total intensity of the 16 integrals between 4.0 and 0.75ppm. 

6. Linear Discriminant Analysis 
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The table of integrals was imported from Microsoft Excel into STATISTICA (StaSoft Pacific 
P/L, Australia) for LDA. Each of the first 15 of 16 chosen integral regions (see Results) formed 
one independent variable in the LDA (Standard method, 7 groups, Tolerance 0.01, a priori 
classification probability proportional to group size). The 16 th region was omitted because one 
region is redundant for discriminant analysis in a normalized data set. Classification functions 
and classification probabilities were calculated with STATISTICA. 



7. Classification of Spectra and Identification of Isolates 

The following definitions are used herein. The term classification refers to assignment of an 
individual spectrum from a bacterial culture to a species group. Identification refers to 
assignment of an isolate to a species group (on the basis of classification of two independent 
spectra derived from duplicate cultures of the isolate). Correct classification refers to 
assignment of a spectrum to the same species group as conventional classification with a percent 
classification probability > 75%. Misclassification refers to assignment of a spectrum to a 
species group different from conventional classification with a percent classification probability 
> 75%. Indeterminate classification refers to assignment of a spectrum to any species group 
with percent classification probability < 75%. Correct identification refers to assignment of 
both spectra of duplicate cultures according to conventional identification and with average of 
percent classification probability > 75%. Misidentification refers to assignment of both spectra 
of duplicate cultures to the same species group but different from conventional identification and 
with average of percent classification probability >75%. Indeterminate identification refers to 
assignment of spectra of duplicate cultures to different groups, or the same group with average 
classification probability < 75%. 
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An optimized classifier for all 7 species groups was developed based on the Robust Bootstrap 
(RBS) method of Somorjai et al. (19), (2). Starting with all 312 spectra, half the spectra were 
randomly selected from each species group and this training set was used to train the 7-group 
classifier (LDA). The resulting classifier was then used to validate the remaining spectra (the 
test set). This process was repeated B times (with replacement) and every time the optimized 
LDA coefficients were saved. The weighted average of these B sets of LDA coefficients 
produces the final classifier(B = 1000). The weight for the rath set is W m = K m C m 1/2 , m = 1,. . .,B> 
where 0 < C m < 1 is the crispness (defined as the fraction of test samples assigned to a class with 
percent probability 75%, and 0 < K m < 1 is Cohen's chance-corrected measure of agreement (4), 
K m = 1 signifying perfect classification of a test set. The B values W m used for the weights are 
those obtained not for the bootstrap training sets, but for the less optimistic test sets. The 
optimized classifier was then used to classify all 312 spectra. Classifier outcome is reported as a 
percent class probability. 

For separate classification of Enterococcus spp. an optimized classifier was developed based on 
RBS of the 62 spectra of three Enterococcus spp. (B=200, LDA parameters as above). 

The RBS classification software was written using STATISTICA, Microsoft EXCEL and 
Microsoft VISUAL BASIC for APPLICATIONS (VBA) and run on a Pentium-based personal 
computer. 

8. Results 
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1 H MR spectra. Representative spectra of each of the 7 species groups and the 16 integration 
regions chosen for analysis, are shown in Fig. 1 . Spectra of ATCC type strains are shown where 
available, otherwise a spectrum of an isolate close to the group centroid (based on integral 
intensities) of all spectra is shown. The most significant contributing metabolities identified for 
each integration region and used for the statistical analyses, are listed in Table 3. 

While it is not possible to show the range of spectral patterns found in the 30-60 spectra 
examined from each species group, Fig. 2 shows the range of normalized integral intensities 
(mean ± SD) measured for each species group. 

9. Classification of Spectra and identification of Isolates 

Results for classification of 312 spectra and identification of 104 isolates from the seven species 
groups based on the optimized classifier are shown in Table 1. A summary of results in terms of 
classification and identification performance is shown in Table 2. Less than 2% of spectra were 
misclassified and less than 1% of isolates misidentified. There were 13 spectra which had 
indeterminate classification. Of the 9 isolates which showed indeterminate identification 5 were 
correctly identified in subsequent or previous cultures at different dates (the remaining 4 isolates 
were not retained in culture storage and could not be retested). 

The results of an attempt to classify the 14 spectra of three Enterococcus casseliflavus and three 
E. gallinarum isolates separately from E. faecalis are shown in Table 4. Notwithstanding the 
small number of isolates examined, these results suggest that it may also be possible to reliably 
identify E. casseliflavus and E. gallinarum separately from E. faecalis. 

10. Reproducibility of spectra 
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Independent analysis of spectra from concurrent, duplicate, cultures and of isolates retrieved 
repeatedly from storage over a 1-8 month period, confirmed that the classification method is 
robust and not affected by short or long-term procedural variability due to factors such as minor 
changes in culture conditions, number of organisms, or storage of isolates (see Table 1). 

11. Discussion 

a. l H MRS and selection of independent variables for multivariate analysis. 

Visible differences between the typical spectra of some species are readily observed, as seen in 
Fig. 1. However, differences between the spectra of species such as S. pyogenes and S. 
pneumoniae are not obvious by visual inspection and the only possibility of reliably 
distinguishing between such similar groups lies in a multivariate analysis of the data. The initial 
step in such an analysis is the extraction from the spectra, which are comprised of many 
thousands of data points, of a manageable set of independent variables in which any significant 
group differences are manifest. While sophisticated methods have been described for selection 
of optimally discriminating spectral regions (21) a simple division was chosen of all spectra into 
16 contiguous regions visually selected on the basis of peaks present in the spectra illustrated in 
Fig. 1. The advantage of this procedure is that the resultant independent variables may be 
assigned a specific biochemical significance (i.e., an independent variable may be associated 
with a particular metabolite or group of metabolites) if the metabolites contributing to the signal 
in each integration region can be identified. Although Table 3 identifies some of the major 
metabolites contributing to the spectra in Fig. 1, the bacterial identification method applied here 
does not depend on identification or quantitation of the metabolites contributing to the MR 
signal. It is, however, important to note that the measured cellular characteristics on which the 
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classification is based are substantially different from those detected during routine identification 
and also different from those measured by other whole organism fingerprinting techniques. 

b. Classification and Identification Strategy 

Classification based on LDA requires that a set of functions derived by LDA of a training set of 
data be used to classify a test set of data, which is preferably independent of the training set 
(cross-validation). The function of the training set is to describe, in terms of the n independent 
variables derived from the MR spectra, the region of ^-dimensional data space occupied by each 
of the a priori defined groups. If the defined groups in the training set are well separated in data 
space, then the LDA will produce classification functions which assign every member of the 
training set to its a priori defined group. The region of data space associated with a particular 
group will increase with phenotype variation between the members of a particular species group, 
and also with procedural (environmental, biochemical and methodological) variation associated 
with repeated culture and classification of spectra of a specific member of a group. A training 
set comprising only a small number of randomly selected members of a particular group is thus 
unlikely to accurately represent the data space (phenotype range) occupied by all members of 
that species group. If the training set contains only a single measurement of each isolate member 
then it may also not account for procedural variability. Consequently, it is to be expected that 
some misclassifications will occur when a classification function based on a training subset of a 
group is used to classify group members which are not members of the training set. 

For classifier robustness and reliability it is desirable that the number of spectra per species 
group in the training set be 5-10 times more than the number of independent variables (19). 
Such large data set appear to be rare, and are usually difficult to acquire, especially if the derived 
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classifier is to be validated against a test set independent of the training set. The Robust 
Bootstrap method attenuates this problem by allowing cross-validated classifier development 
with all of the available data (19). 



The ease of preparation and examination of duplicate or even triplicate cultures of a particular 
clinical isolate, as used herein, has the advantage that a consensus identification of the isolate 
based on multiple independent analyses, is obtained. This feature of the isolate identification 
strategy has not been applied in other microbial whole organism fingerprinting studies (5, 8) in 
which, at best, only instrument duplicates were acquired. In a few cases the duplicates may be 
incorrectly classified as different species. Consequently, identification based on analysis of a 
single subculture of an isolate cannot be assigned the same confidence level as an identification 
based on classification of independent duplicate cultures. When using conventional methods, 
which report an identification probability based on analysis of a single culture of an isolate, it is 
common practice to reexamine isolates for which the identification probability < 75%. Analysis 
is repeated until a single test returns an identification probability > 75%. By this method it is 
possible that the average identification probability of all tests on an isolate will be < 75% at the 
conclusion of testing. The method of always testing duplicate cultures and requiring that correct 
identification be based on an average probability > 75% imposes a more rigorous and reliable 
identification constraint. In an applied (clinical laboratory) environment, in which testmg of 
duplicates is made a routine procedure, disparity between the assigned group of duplicates 
should be interpreted as an indicator of procedural problems and such isolates should be retested 
or examined with supplementary techniques. 
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Phenotypic variability within species groups was addressed by examination of at least 1 1 isolates 
from each species group. The general success of the classification method used indicates that 
between the species groups there are significant and consistent spectral differences which are 
larger than the typical range of variation within species due to procedure or phenotype. 

C. Classification and Identification results 

The very small number of misclassifications of spectra could not be attributed to any specific 
steps of the method. Potential problems with reproducibility due to short- and long-term 
procedural variability (use of different batches of culture medium, storage of isolates, etc) were 
excluded by undertaking separate analysis of spectra from duplicate cultures of all isolates and 
reculture and reclassification of spectra of 25 isolates, at times up to 8 months after original 
culture and spectroscopy. The single instance of misidentification (S. pyogenes Lab. No. 221- 
2985) may have been the result of contamination. Previous and subsequent tests of the same 
isolate gave correct identification results. There are several characteristics of the method utilized 
herein which point to the robust nature of the identification. 

Firstly, the growth conditions for the samples are not strictly controlled. For example, the 
precise constitution of the growth medium may vary from batch to batch (base media from two 
different manufacturers and multiple batches of horse blood were used). The size of the 
inoculum may vary from plate to plate. Growth of bacteria on an agar plate is inherently 
inhomogeneous, due to crowding and slow diffusion of oxygen and other nutrients through 
colonies and agar. Early experiments with triplicate cultures of all isolates demonstrated a lack 
of variation in spectra from cells grown on single batches of medium. Due to large variations 
between species in the amount of growth obtained overnight on HBA plates (growth of S, milleri 
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was usually very poor) the wet weight of cells resuspended varied from 2-200 mg. As the MR 
signal is directly proportional to sample concentration, poor bacterial growth required only an 
extended number of transients to achieve adequate signal to noise ratio. The phase correction 
and integration steps of spectrum processing, as implemented, required some subjective operator 
input. These deficiencies in the method will introduce some extra variance into the data. They 
may be overcome by use of magnitude spectra and automated integration (23). Other whole 
organism fingerprinting techniques are reported to require strict control of growth media and 
repeated standardization with control cultures (12), (11). 

A sufficient number of isolates in the S. milleri group were examined to attempt an MRS based 
assignment of the isolates to one of the three species within the S. milleri group (S. anginosus, S. 
constellatus, S. intermedins). However, the results demonstrate that the group is physiologically 
homogeneous relative to the diversity of the seven species groups examined. Similarly, the E. 
casseliflavus and E. gallinarum isolates examined are physiologically more similar to E. faecalis 
than to the Streptococcus and Staphylococcus species tested. 

d. Choice of growth medium 

In selecting the most appropriate medium for use in a clinical diagnostic or reference laboratory, 
a universal growth substrate and ease of sample preparation were of prime importance. Since 
HBA is a common medium in use in diagnostic microbiology laboratories and bacterial cells 
could be easily harvested directly from HBA plates without the need for washing, this growth 
medium was chosen as best satisfying the objectives. 
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5 There were major differences between spectra obtained herein and those published for S. aureus 
and E. faecalis grown on trypticase soy sheep blood agar (5). In the latter study, interpretation of 
spectral patterns was reportedly not affected by the choice of growth medium, possibly because 
spectral patterns were inspected visually and distinguished by peak positions rather than peak 
intensities. Growth on or in different media (HBA versus brain hear infusion broth) affected 
10 relative peak intensities due to changes in metabolite pool sizes, much more significantly than 
peak positions, which may be slightly affected by factors such as intracellular pH. 

|| e. Clinical application 

Hi ; 

Though based on a limited set of gram positive bacteria, the results suggest that l H MR 

MM 

|5| spectroscopy of whole cells is of comparable precision and accuracy to established, automated, 

' V 5*" ■ 

it ■ 

i±§ methods of species identification. These include common laboratory systems such as VITEK 

f || (22). The non-destructive nature of the method enables retention of viable organisums post- 
11"^ 

Cl analysis for subsequent checking of contamination or methodological errors. 
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The use of more sophisticated pattern recognition methods than those used herein (see(19)) may 
20 further improve discrimination and allow separate classification within the species groups, albeit 
at the possible expense of easily interpreted biochemical information. For an application 
dedicated to identification rather than characterization, this would be an acceptable compromise. 
The extreme ease of sample preparation, biochemically informative results, rapid automated 
identification, and the robust nature of the method are attractive for clinical and industrial 
25 applications. In practice, this may be of most value for those bacterial species which are 
relatively slow-growing or difficult to identify by conventional methods. 
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The method described is simple, rapid, reliable, and informative. It is reliable because the 
identification result is based on analysis of independently cultured and analysed duplicates. It is 
simple and rapid because a microorganism culture can be easily prepared and analyzed within 20 
minutes. The method is informative because it gives a quantitative estimate of the probability of 
an isolated microorganism belonging to a particular group. 



The method would be applicable to hospital laboratories which require rapid identification of 
infectious agents to expedite appropriate and safe treatment. In industrial situations the method 
would improve process reliability and efficiency. 



Example 2: Using MRS to Distinguish 
Cryptococcomas from Gliomas In Rats 
and Cell Culture 



1. Introduction 



MRS was used to characterize clinical isolates of C. neoformans and a glioma cell-line in culture 
and in experimental rats. ID and 2D *H MR spectra were acquired from fungi cultured in vitro 
(16 isolates of Cryptococcus neoformans, 3 of Candida albicans, 3 of Aspergillus fumigatus, 3 of 
Saccharomyces cerevisiae) and a C6 glioma cell line. Cerebral biopsies were obtained from 
healthy rats and animals with experimental infections or gliomas (19 healthy brams, 19 
cryptococcomas and 20 gliomas). Unequivocal signal assignment was performed for cell 
suspensions as well as tissue samples using homo- and heteronuclear 2D correlation spectra 
(COSY, TOCSY, 'H^C-HSQC and HMBC). The results indicated that MR spectra from C. 
neoformans and cerebral cryptococcomas, but not from other fungi, healthy brain, or gliomas, 
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5 were dominated by resonances from the cytosolic disaccharide a,a -trehalose. This spectral 
pattern was very different from that of gliomas, which was dominated by lipids and an increased 
choline/ creatine ratio, and from healthy brain. The results led to the conclusion that a 
remarkably high concentration of <x,a -trehalose in relation to other MR-visible metabolites is 
diagnostic of C. neoformans. Cerebral cryptococcomas are a uncommon but serious 
10 manifestation of cryptococcosis in humans. Application of these results to the non-invasive 
diagnosis of cerebral cryptococcomas would reduce the risk and expense of unnecessary surgery 

!pi or biopsy and expedite patient management. 

CI 

J % 2. Materials and Methods 

- tr-. 
• J' 1 "'''Pi'. 

Jt5 a. In vitro cultures of fungi and C6 glioma cell line 

|i| Sixteen cryptococcal isolates were cultured in vitro and studied by MRS. These included 8 

w -a 
I! Ml 

%k clinical isolates of C. neoformans serotype A (clinical isolates from lung, blood, CSF and brain), 

CI 

f J 7 isolates of serotype B (clinical (brain and CSF), veterinary and environmental isolates) and 1 
clinical isolate of the teleomorph of C. neoformans, Filobasidiella neoformans var. bacilliformis 

20 (American Type Culture Collection 32609). Other fungi which were cultured in vitro and 
studied by MRS, included 3 each of the yeasts Candida albicans (clinical isolates) and 
Saccharomyces cerevisiae (environmental isolates and type cultures) and the fungus Aspergillus 
fumigatus (clinical isolates). Yeasts were identified biochemically using the API 20C AUX 
system (BioMerieux, March FEtoile, France). Cryptococci were biotyped (45) and serotyped 

25 (Crypto Check agglutination test, Iatron Labs). Fungi were cultured for 24-48 h on Sabouraud's 
dextrose agar (SDA, Difco Labs, Detroit, MI, USA), then either in Brain Hear Infusion broth at 
30°C {A. fumigatus) or in yeast nitrogen broth (Difco) containing 1% glucose, buffered at pH 7.0 
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5 with 0.345% w/ v MOPS (Sigma Chemical Co., St. Louis, MO, USA), at 27, 30 and/or 37°C, C6, 
a rat glioma cell line, was maintained as described (51) and used within 3-30 passages. 
Immediately before use, logarithmic phase fungal cells, or C6 glioma cells, were washed and 
resuspended in dulbecco's phosphate-buffered saline (PBS, Difco) for animal models or in PBS 
made up with 99.5% deuterium oxide (D 2 0 for MRS. 

10 

Culture conditions for the growth of one isolate of C. neoformans (Mc Bride strain) were varied 

;1 - 

13 to test the effect of stress on MR-visible metabolite profiles. The isolates were cultured in 

■ tt * *>- - 

jflf buffered yeast nitrogen broth (Difco Labs, Detroit, MI, USA) containing 10 mM glucose. The 
|| following parameters were varied: incubation temperature (27, 35 and 42°C), pH (5 and 7), 

w\ i ' ml 

0 15 glucose concentration (1, 10, 50 mM), substitution of glucose with mannose (lOmM) or sucrose 

r S *. t&'» 

Jfi| (10 mM) and prolonged incubation in glucose-free medium (0-100hr). 

1^ b. Animal studies 

Three isolates of C neoformans serotype B (WM276, WM430, Mc Bride) and the C6 glioma 
cell line were used for animal experiments. Male Wistar-furth and female Fischer 344 rats (150- 
20 250g, Animal Research Council, Perth, WA) were anaesthetized by inhalation of 4% halothane 
in 100% oxygen prior to intraperitoneal injectin of ketamine (11.6 mg/kg, Apex Laboratories, 
Sydney) and xylazine (1.2 mg/kg, Apex Laboratories, Sydney), and allowed to breathe room air 
spontaneously. For induction of brain lesions, the animal head was fixed in a sterotactic frame 
(David Kopf Instruments, Tajunga, CA, USA); 5 \il of a suspension of C. neoformans serotype 

25 B, or c6 glioma cells, was injected through a straight, flat-ended 26 gauge needle at a rate of 3-6 
|xl/ min. Preliminary experiments established that 5 x 10 4 cfu of cryptococci suspended in a 
volume of 5 \il 9 and 1 x 10 6 C6 cells, also in a volume of 5 jal, induced lesions of at least 3 mm 
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in diameter, when harvested 6-12 days (cryptococcomas, n=18 rats) or 12-30 days (gliomas, n-26 
rats) post-operatively (data not shown). Optimal coordinates for microinjection were; 2.0 mm 
below dura, 3.0 mm lateral, and 2.4 mm anterior-posterior relative to ear bar zero. Using these 
coordinates, rats were injected for the MR study with the C. neoformans serotype B isolate Mc 
Bride (cryptococcomas, n^O, gliomas, n= 19; control, n=19). Control tissue was obtained from 
saline-injected rats. At appropriate times, rats were sacrificed, the brain was removed and cut 
transversely at the site of the lesion. Brain tissue was fixed in formalin, and embedded in 
paraffin. 7 |im sections were taken and stained with haematoxylin and cosin or periodic acid- 
Schiff reagent (PAS) for light microscopy. Brain tissue samples (diameter up to 4 mm) from 
each of the animals with cryptococcomas or gliomas and from controls were suspended in 
PBS/D 2 0, snap-frozen in liquid nitrogen, and stored at 70°C for up to 4 months for MRS 
analysis. 

Animal experimentation was carried out according the Australian National Health and Medical 
Research Council Guidelines and with ethical approval from the University of Sydney Animal 
Ethics Committee. 

7. MR experiments 

J H MR spectra were obtained on a Bruker Avance 360 MHz spectrometer equipped with a 5-mm 
{ H, } inverse-detection dual-frequency probe. The temperature was maintained at 37 C. 
Residual water signal was suppressed by selective gated irradiation (46) or by selective 
excitation using pulse field gradients (47). Chemical shifts were referenced to external sodium 
3-(trimethylsilyl) propanesulfonate (TSP) at 0.00 ppm or internal water (4.65 ppm), respectively. 
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ID l H MR spectra were acquired with a spectral width 3600 Hz, time domain 8k, 128 or 256 
acquisitions, relaxation delay Is. A line broadening of 1 or 3 Hz was applied for cell culture and 
tissue samples, respectively, prior to Fourier transformation. Resonance ratios from fully relaxed 
*H MR spectra were used for comparison of cell types. Packed cell suspensions and samples 
were spun at 20 Hz to avoid the cells settling in the MR tube. A relaxation delay of 5 s was 
applied to allow full relaxation. 

Two-dimensional MR spectra were acquired for unequivocal signal assignment. {*H, ! H} COSY 
experiments were performed in magnitude-mode (48). Acquisition parameters were: sweep 
width in t2 3600 Hz, t2 time domain 2k, 256 increments of 32 or 48 acquisitions each, relaxation 
delay Is. Sine-bell window functions were applied in the ti dimension, and Gaussian-Lorentzian 
window functions were applied in the t2 dimension. Zero filling was used to expand the data 
matrix to 1 K in the ti dimension. Cross-peak volumes were determined as described (49). 

TOCSY spectra with mixing times of 40ms and 120ms were acquired with 256 increments of 2K 
data points and 48 scans per increment for confirmation of assignments (1). 

{ H, li C } one-bond shift correlation spectra were obtained in the H detection mode using an 
HSQC pulse sequence (50) for some samples to confirm signal assignments. The l H MR spectra 
width was 3600Hz and the 13 C NMR spectral width was 15000 Hz. 13 C MR decoupling during 
acquisition was achieved by GARP-1 (8). The evolution time (ti) was incremented to obtain 256 
FIDs, each of 80 acquisition and consisting of 2K data points. The relaxation delay was Is. A 
sine-bell function was applied in the t2 dimension and a Gaussian-Lorentzian function was 
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5 applied in the ti dimension. Zero filling to 1 K was used in the ti dimension prior to Fourier 
transformation. 

'H MR spectra and 2D {'H, 'H} COSY were acquired from all fungal isolates, the C6 glioma 
cell line and rat brain samples (20 with cryptococcomas, 19 with gliomas and 19 controls). 
Signal assignment was confirmed by TOCSY and HSQC for at least one isolate or sample of 
10 each of the four fungal species, the C6 glioma cell line and the different brain biopsy samples. 
8. Results 

u s 

CI a. Cell culture studies 

W Typical one (ID) and two-dimensional (2D) MR spectra from fungi cultured in vitro and the C6 
|1j glioma cell line are compared in Fig. 3. Major cross-peaks from the 2D spectra are summarized 

J§jJ 

T5 m Table 5. Resonances were assigned either by comparison with published data (32, 40, 44, 51, 

r -r ■ t— j 

'f ' \f r 

mi 52) or by primary analysis of COSY, TOCSY and HSQC spectra. Resonances listed in Table 5 

hk and shown in Fig. 3 were present in spectra from all isolates of the respective fungi and the C6 

fpj glioma cell line. Resonance intensities varied between isolates and shown in Table 6. 

20 b. C neoforman cultured in vitro 

One-dimensional and 2D COSY MR spectra of C. neoformans were dominated by resonances 
from lipid and a,a, -trehalose signals. The spectral pattern of lipids was distinctive with chemical 
shifts at 0.90 ppm, 1.30 ppm, 1.60 ppm, 2.00 ppm, 2.30 ppm and 5.38 ppm (51). Resonances at 
3.46 ppm, 3.66ppm, 3.77-3.85 ppm and 5.19 ppm were assigned to a,a, -trehalose using HSQC 

25 spectroscopy on the cell suspensions. The respective assignments were: hi- CI (5.19- 
93.50ppm), H2- C2 (3.66- 71.50ppm), H3- C3 (3.86- 73.0ppm), H4- C4 (3.46-70.00ppm), H5-C- 
5 (3.83- 72.50ppm) and H6-C6 (3.78 & 3.88/ 61.00ppm). Less intense cross-peaks from amino 
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acid residues [lysine (lys), alanine (ala), threonine (thr) and glutamate/glutamine (glu/gln)] and 
ethanol were evident in some strains (summarized in Tables 5 and 6). The dominant MR visible 
extracellular metabolities are acetate (1.92 ppm) and ethanol (1.16 and 3.63 ppm). 

c. Effect of stress on cryptococcal cells 

Since trehalose has been reported to protect fungi against adverse conditions (heat, desiccation, 
osmotic and oxidative stress etc.) (54-57) consideration was given to the possibility that the MR- 
visible trehalose could vary with culture conditions. The effect of temperature, pH, glucose 
concentration, substitution of glucose with other sugars and incubation time as specified in 
Materials and Methods was tested. The respective resonance ratios varied by no more than a 
factor of four relative to standard conditions (data not shown). Lipid and trehalose signals 
remained dominant in ID and COSY spectra, irrespective of culture conditions. 

d. MRS of other fungi cultured in vitro 

MRS of two other clinically important, pathogenic fungi, Candida albicans and Aspergillus 
fumigatus, and the yeast Saccharomyces cerevisiae were investigated, and found to be different 
from C. neoformans. The ID and 2D COSY spectra of C albicans (Fig. 3b) and S. cerevisiae 
(Fig. 3d), revealed dominant lipids, whereas those from A, fumigatus (Fig. 3c) were characterized 
by resonances from amino acid residues and carbohydrates. Carbohydrates resonances from 
these fungi were of a much lower intensity than those from C. neoformans and could not be 
assigned to specfic monosaccharide residues. Very small amounts of MR-visible a,a-trehalose 
(approximately 20 times lower than in C. neoformans) were identifiable only in the COSY 
spectra of two of the three strains of S. cerevisiae and none in C. albicans, if exposed to high 
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temperatures (37-43°C). Ethanol was identified in the two yeast species. Acetate was visible in 
the spectra from the culture supernatants from C. albicans and A.fumigatus (data not shown). 

e. C6 glioma cell line 

The spectra of suspended C6 glioma cells cultured in vitro (Fig. 3e, summarized in Tables 3 and 
4) were dominated by resonances from amino acids. The 2D spectra of C6 cells revealed no 
carbohydrate cross-peaks. Intense cross peaks arising from choline (chol), phosphocholine (PC) 
and glycerol-phosphocholine (GPC) and relatively high amounts of taurine (tau) and the amino 
acid residues leucine (leu) and glu/gln were present, as has been reported for tumour cell lines 
(51). The resonance ratio of 3.25 to 3.05 ppm, representing choline- and creatine-containing 
compounds, respectively, was much higher than that in the spectra from fungi (Table 6). 

f. Animal studies 

Histopathology of biopsy samples from rat brains showed that the biomass of cryptococcomas 
was comprised predominantly of cryptococci, as is seen in human infection and verified the 
pathology of tumours grown in the rat model (data not shown). Representative ID *H MR and 
2D COSY spectra from control rats, rats with cerebral cryptococcoma and rats with gliomas are 
shown in Fig. 4. Resonance ratios are summarized in Table 6. 

Spectra from control brain tissue were dominated by N-acetyl aspartate (NAA) at 2.00 ppm. 
Characteristic signals of lower intensity included composite peaks at 2.0-2.2 ppm (glutamine/ 
glutamate), at 3.0 ppm (creatine, phosphocreatine, (y-amino) butanoic acid (GAB A) and lys 
residues); at 3.2 ppm [N(CH 3 ) 3 groups of choline, PC and GPC etc] and at 3.6 to 3.9 ppm (Ha of 
amino acid residues and /nyo-inositol), amino acids GABA, chol, Pcand GPC signals were also 
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present in COSY spectra. Lactate signals of variable intensity were found at 1.3 ppm, resulting 
from anaerobic metabolism occurring in the time between excision and freezing. 

Cryptococcoma gave ID and COSY MR spectra with the typical pattern of cryptococcal 
trehalose and lipids described above. The intensity of the lipid resonance at 5.38 ppm to the 
trehalose resonance at 5.19 ppm varied over a wide range (Table 6). Furthermore, the 3.2 : 3.0 
ppm resonance ratio was elevated compared with normal brain tissue. The NAA signal 
decreased dramatically and was undetectable in some samples. Other resonances not observed in 
normal brain tissue arose from acetate (ID spectra) and ethanol (2D COSY) in some, but not all 
spectra. Also, a distinct cross peak arose from GPC, which was of much higher intensity than in 
control and glioma spectra. The myoinositol and GABA cross-peak intensities in the COSY 
spectra were reduced relative to the amino acid cross-peaks when compared with those in control 
brain tissue. 

Spectra from tumour biopsies were dominated by lipid signals and an increased resonance ratio 
at 3.20:3.00 ppm, which is consistent with many reports in the literature (32, 58, 59). The 
relative increase in lipid signal intensities and the elevation in the 3.2 : 3.0 ppm ratio was, for 
most samples, much larger than the increase found in cryptococcomas (Table 6). Resonance 
ratios for the lipid varied. NAA remained undetectable in many tumour specimens, indicative of 
absent neuronal activity. The only cross-peaks apart from lipids, that increased relative to other 
amino acid residue cross-peaks (e.g. lys, leu, etc.) were those of taurine (3.28-3.50 ppm), choline 
(3.50-4.07 ppm), PC (3.61-4.19 ppm) and phosphoethanolamine (3.22-3.98 ppm). 
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9. Discussion 

C. neoformans was distinguished unequivocally from other yeasts, the filamentous fungus A. 
fumigatus and C6 glioma cells by MRS, due to an abundance of the non-reducing disaccharide, 
trehalose. These differences were recorded in spectra from cells cultured in vitro, as well as 
from affected tissue from rat cortex, where the diagnosis was confirmed histologically. MRS has 
thus provided a means of distinguishing cryptococcomas from healthy brain and brain tumour 
tissue in biopsy samples. 

(C. neoformans in infected tissued is surrounded by a capsule which typically occupies many 
times the volume of the fungal cell. This capsule is composed predominantly of 
glucuroxylomannans (GXM) in a loosely-woven fibrillar configuration (42, 43). None of the 
cell-associated GXM from samples sultured in vitro, or from tissue biopsies, were MR-visible, 
indicating that native capsular polysaccharides are not sufficiently mobile to be visible using 
MRS. 

In contrast, the cytosolic compound, trehalose, was identified and present in very large amounts. 
Trehalose is present in yeasts, and other fungi (60) and is therefore not a unique characteristic of 
c. neoformans, per se. It is an important protectant induced by conditions of heat (57), osmotic 
stress (61), dehydration (56), dessication and other (for review see (54) and (55)). Trehalose 
levels in C. neoformans cultured at 25°C, however, exceeded those n S. cerevisiae under 
conditions of heat stress (37°C) by at least 20 times. Altered culture conditions did not reduce 
the intensity of the trehalose signal in C. neoformans. 
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The large amount of trehalose relative to other MR-visible compounds in the spectra of C. 
neoformans defines trehalose as one marker that can be used to distinguish C neoformans from 
other fungi. It is possible that such high levels of trehalose in cryptococci are an evolutionary 
response to environmental stress, particularly temperature, dehydration and starvation. 
Adaptation to survival and growth at physiological temperatures, a recognized virulence 
determinant of C. neoformans (62) is consistent with high intracellular concentrations of 
trehalose. 

Bacterial metabolities, but not a,a -trehalose have been identified by *H MRS in pus samples 
from patients with bacterial brain abscesses (5, 34, 35, 39). Ethanol, a product of glucose 
fermentation in yeasts, was reported to be present in the CSF of a patient with ciyptococcal 
meningitis (63). The predominant extracellular metabolites (acetate, ethanol) found in the 
present study and by Bubb et al (44) are not suitable for definitive in vivo or ex vivo diagnosis of 
cryptococcomas as they are also produced by other pathogenic microorganisms (5). The 
distinctive acetate signal present in many crytococcomas, but not healthy or neoplastic tissue, 
may, however, be a useful diagnostic indicator of infection. Acetate is produced by bacteria and 
has been identified by MRS in bacterial abscesses (34, 35, 39) as well as in this study, in C. 
neoformans and cryptococcomas. 

The results discussed above indicate that MRS can distinguish unequivocally between healthy 
brain, and experimental crytoccocomas and gliomas from rats. The high level of MR visible a,a 
-trehalose recorded from cyptococcomas provides a basis for the path for the pathological 
diagnosis of cerebral cyptococcomas. When this method is applied to in vivo diagnosis of 

38 



62620/PJP 



cerebral cryptococcomas in humans, cerebral cyptococcomas will no longer be mistaken for 
malignancies by conventional imaging modalities. An early and correct diagnosis will reduce 
the high morbidity and mortality (27) that occurs when diagnosis is delayed. Using in vivo MRS 
as a non-invasive method of diagnosis of infective lesions in the brain will reduce the risk and 
expense of unnecessary surgery or biopsy and expedite patient management decision. 
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Example 3; 
Identification of Pathogenic Fungi 



1. Microorganisms 

Two hundred and five cultures of the pathogenic yeasts Candida albicans, C. parapsilosis, C. 
tropicalis, C. krusei and C. glabrata were cultured for 48h on Sabouraud's dextrose agar at 30°C. 
69 and 70 isolates of the pathogenic yeast Cryptococcus neoformans var. neoformans and var. 
gattii, respectively, were cultured for 48h on Sabouraud's dextrose agar at 37°C. Yeasts were 
20 identified biochemically using the API 20C AUX system (BioMerieux, Marcy l'Etoile, France). 
Cryptococci were biotyped and serotyped (Crypto Check agglutination test, Iatron Labs). 
Additionally, PCR fingerprinting was used to compare the genotype of the respective species. 
Colonies were scraped from the plates and resuspended in PBS/ D20 immediately before the 

MR experiments. 

25 2. MR spectroscopy 

MR spectra were acquired on a 360 MHz Bruker Avance NMR spectrometer using an 5 mm {1 
H, 13C} inverse detection probe. Signal assignment was performed using COSY, TOCSY (tm 
=40, 150 ms), 1H, 13C HSQC (optimized to U=130 Hz) and 1H, 13C HMBC (optimized to n 

30 J=7Hz). 
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3. Statistical Classification Strategy (SCS)(9): The five Candida species were subdivided into 
two groups containing cultures from 2 or 3 species, respectively. Pair-wise classification was 
performed to distinguish between these groups and later to distinguish between the species in 
each group. Pair-wise classification was performed for the two G neoformans varieties. 
Magnitude MR spectra were normalized to the total integral and analyzed by a genetic- 
algorithm-based Optimal Region Selector(66) to identify three maximally discriminatory 
subregions, using rank-ordered first derivatives of the spectra(9). Using these three regions, a 
Linear Discriminant Analysis based classifier was developed. The robustness of the classifier 
was tested by bootstrap-based(67) crossvalidation (1000 repeats)(9). Class assignment was called 
crisp if class probabilities were > 0.75. 

4. Results 

MR spectra of all yeasts (Candida species as well as Cryptococcus varieties) were dominated by 
signals of lipids, carbohydrates (trehalose, glucose), polyols (glycerol, mannitol, glucitol and 
others), ethanol and amino acid residues. 

a. Candida species: Visual analysis of the ID MR spectra of Candida species allowed a 
distinction into two groups: (A) C. krusei and C. glabrata; (B) C. albicans, C parapsilosis and 
C. tropicalis. These differences were mainly due to a higher carbohydrate and ethanol content in 
group A. These groups are consistent with a phylogenetic tree based on partial actin gene 
sequences64. Pair-wise classification between groups A and B and between species in each 
group resulted in overall accuracies of up to 99%. 

b. Cryptococcus varieties: Both ID and 2D MR spectra from the G neoformans varieties were 
indistinguishable by visual inspection. However SCS on ID MR spectra distinguished the 
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varieties neoformans and gattii with an accuracy of 98.6 % using a pair-wise statistical 
classification strategy. 
5. Discussion 

MRS data analyzed by SCS were compared with currently used biochemical identification tests 
and molecular biological methods (PCR fingerprinting). Different species and varieties of fungi 
could be identified by applying a Statistical Classification Strategy to MR spectra. Thus 
identification of fungi was possible below the taxonomic rank of species with a high degree of 
accuracy in these particular systems. The rapidity with which the SCS algorithm could analyze 
the MR data, collected and processed in less than 10 minutes, makes this an attractive option for 
routine testing in microbiology laboratories. Thus, according to the invention, SCS-based 
analysis of MR data can identify both species and varieties of fungi faster than other 
conventional methods, with a high degree of accuracy. 
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Figure 5 shows a spectrometer 10, which may be a Bruker Avance 360 MHz MR spectrometer, 
with equipped computer. The statistical classification strategy (SCS) computer 12 stores the 
SCS and other programs described therein. The clinical data base includes the information from 
the data acquisition and the identity of known microorganisms and the like which is used by the 
computer 12 to develop the classifier 16. 

Although the examples described relate to in vitro analysis, the present invention can be used for 
in vivo analysis, in which case a more powerful magnet may be obtained in the spectrometer. 

Although at least one embodiment of the invention has been shown and described, variations and 
modifications may occur to those skilled in the art. The invention is not limited to the preferred 
embodiment, and its scope is determined only by the appended claims. 
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Abstract of the Disclosure 

A statistical classifier identifies microorganisms, such as bacteria and fungi, using 
magnetic resonance spectroscopy, with multivariate analysis. 

The bacteria may include species within Staphylococcus, Enterococcus and 
Streptococcus. The fungi may include pathogenic yeasts including species with Candida 
and Cryptococcus. 
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Table 2. Summary of Classification 
and Identification Results 



Classification Type 


Count 


% of total 


Correct 


294 


94.2 


Indeterminate 


13 


4.2 


Misclassification 


5 


1.6 


Total: 


312 


100.0 








Identification Type 


Count 


% of total 


Correct 


146 


93.6 


Indeterminate 


9 


5.8 


Misidentiftcatlon 


1 


0.6 


Total: 


156 


100.0 
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